Coex-rank: an approach for microarray combined analysis - applications to PPARγ related datasets
نویسنده
چکیده
Microarrays have been widely used to study differential gene expression at the genomic level. They can also provide genome-wide co-expression information. Robust approaches are needed for integration and validation of independently-collected datasets which may contribute to a common hypothesis. Previously, attempts at meta-analysis have contributed to solutions to this problem. As an alternative, for microarray data from multiple highly similar biological experimental designs, a more direct combined approach is possible. In this thesis, a novel approach is described for microarray combined analysis, including gene-level unification into a virtual platform followed by normalization and a method for ranking candidate genes based on co-expression information – called Coex-Rank. We applied this approach to our Sppar (a PPARγ mutant) dataset, which illustrated an improvement in statistical power and a complementary advantage of the Coex-Rank method from a biological perspective. We also performed analysis to other PPARγ-related microarray datasets. From the perspective of gene sets, we observed that up-regulated genes from mice treated with the PPARγ ligand rosiglitazone were significantly down-regulated in mice with a global knock-in dominant-negative mutation of PPARγ. Integrated with publicly available PPRE (PPAR Response Element) datasets, we found that the genes which were most upregulated by rosiglitazone treatment and which were also down-regulated by the global knock-in mutation of PPARγ were robustly enriched in PPREs near transcription start sites. In addition, we identified several potential PPARγ targets in the aorta and mesenteric artery for further experimental validation, such as Rhobtb1 and Rgs5.
منابع مشابه
Integration and Reduction of Microarray Gene Expressions Using an Information Theory Approach
The DNA microarray is an important technique that allows researchers to analyze many gene expression data in parallel. Although the data can be more significant if they come out of separate experiments, one of the most challenging phases in the microarray context is the integration of separate expression level datasets that have gathered through different techniques. In this paper, we prese...
متن کاملSFLA Based Gene Selection Approach for Improving Cancer Classification Accuracy
In this paper, we propose a new gene selection algorithm based on Shuffled Frog Leaping Algorithm that is called SFLA-FS. The proposed algorithm is used for improving cancer classification accuracy. Most of the biological datasets such as cancer datasets have a large number of genes and few samples. However, most of these genes are not usable in some tasks for example in cancer classification....
متن کاملA hybrid filter-based feature selection method via hesitant fuzzy and rough sets concepts
High dimensional microarray datasets are difficult to classify since they have many features with small number ofinstances and imbalanced distribution of classes. This paper proposes a filter-based feature selection method to improvethe classification performance of microarray datasets by selecting the significant features. Combining the concepts ofrough sets, weighted rough set, fuzzy rough se...
متن کاملModification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis
Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...
متن کاملDeep Unsupervised Domain Adaptation for Image Classification via Low Rank Representation Learning
Domain adaptation is a powerful technique given a wide amount of labeled data from similar attributes in different domains. In real-world applications, there is a huge number of data but almost more of them are unlabeled. It is effective in image classification where it is expensive and time-consuming to obtain adequate label data. We propose a novel method named DALRRL, which consists of deep ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010